Linkage disequilibrium matches forensic genetic records to disjoint genomic marker sets.

نویسندگان

  • Michael D Edge
  • Bridget F B Algee-Hewitt
  • Trevor J Pemberton
  • Jun Z Li
  • Noah A Rosenberg
چکیده

Combining genotypes across datasets is central in facilitating advances in genetics. Data aggregation efforts often face the challenge of record matching-the identification of dataset entries that represent the same individual. We show that records can be matched across genotype datasets that have no shared markers based on linkage disequilibrium between loci appearing in different datasets. Using two datasets for the same 872 people-one with 642,563 genome-wide SNPs and the other with 13 short tandem repeats (STRs) used in forensic applications-we find that 90-98% of forensic STR records can be connected to corresponding SNP records and vice versa. Accuracy increases to 99-100% when ∼30 STRs are used. Our method expands the potential of data aggregation, but it also suggests privacy risks intrinsic in maintenance of databases containing even small numbers of markers-including databases of forensic significance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Pattern of Linkage Disequilibrium in Livestock Genome

Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...

متن کامل

The Impact of Different Genetic Architectures on Accuracy of Genomic Selection Using Three Bayesian Methods

Genome-wide evaluation uses the associations of a large number of single nucleotide polymorphism (SNP) markers across the whole genome and then combines the statistical methods with genomic data to predict the genetic values. Genomic predictions relieson linkage disequilibrium (LD) between genetic markers and quantitative trait loci (QTL) in a population. Methods that use all markers simultaneo...

متن کامل

Effects of Marker Density, Number of Quantitative Trait Loci and Heritability of Trait on Genomic Selection Accuracy

The success of genomic selection mainly depends on the extent of linkage disequilibrium (LD) between markers and quantitative trait loci (QTL), number of QTL and heritability (h2) of the traits. The extent of LD depends on the genetic structure of the population and marker density. This study was conducted to determine the effects of marker density, level of heritability, number of QTL, and to ...

متن کامل

Reliability of genomic predictions across multiple populations.

Genomic prediction of future phenotypes or genetic merit using dense SNP genotypes can be used for prediction of disease risk, forensics, and genomic selection of livestock and domesticated plant species. The reliability of genomic predictions is their squared correlation with the true genetic merit and indicates the proportion of the genetic variance that is explained. As reliability relies he...

متن کامل

Association mapping for resistance to powdery mildew in oriental tobacco (Nicotiana tabaccum L.) germplasm

Powdery mildew caused by Erysiphe cichoracearum is an important fungal disease which threatens tobacco (Nicotiana tabacum L.) production. The objective of this study was to determine DNA markers linked to genomic regions associated with resistance to powdery mildew in tobacco through the association mapping approach. Seventy tobacco geno-types were fingerprinted using 26 simple se-quence repeat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 114 22  شماره 

صفحات  -

تاریخ انتشار 2017